• Big Data Pipelines on the Computing Continuum: Ecosystem and Use Cases Overview 

      Roman, Dumitru; Nikolov, Nikolay; Soylu, Ahmet; Elvesæter, Brian; Song, Hui; Prodan, Radu; Kimovski, Dragi; Marrella, Andrea; Leotta, Francesco; Matskin, Mihhail; Ledakis, Giannis; Theodosiou, Konstantinos; Simonet-Boulogne, Anthony; Perales, Fernando; Kharlamov, Evgeny; Ulisses, Alexandre; Solberg, Arnor; Ceccarelli, Raffaele (Proceedings of the IEEE Symposium on Computers and Communications;2021 IEEE Symposium on Computers and Communications (ISCC), Peer reviewed; Journal article, 2021-12-15)
      Organisations possess and continuously generate huge amounts of static and stream data, especially with the proliferation of Internet of Things technologies. Collected but unused data, i.e., Dark Data, mean loss in value ...
    • Big Data Pipelines on the Computing Continuum: Tapping the Dark Data 

      Roman, Dumitru; Prodan, Radu; Nikolov, Nikolay; Soylu, Ahmet; Matskin, Mihhail; Marrella, Andrea; Kimovski, Dragi; Elvesæter, Brian; Simonet-Boulogne, Anthony; Ledakis, Giannis; Song, Hui; Leotta, Francesco; Kharlamov, Evgeny (Computer;Volume: 55, Issue: 11, Peer reviewed; Journal article, 2022-10-25)
      Big Data pipelines are essential for leveraging Dark Data, i.e., data collected but not used and turned into value. However, tapping their potential requires going beyond existing approaches and frameworks for Big Data ...
    • Big data workflows: Locality-aware orchestration using software containers 

      Corodescu, Andrei-Alin; Nikolov, Nikolay; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir H.; Roman, Dumitru (Sensors;Volume 21 / Issue 24, Peer reviewed; Journal article, 2021-12-08)
      The emergence of the Edge computing paradigm has shifted data processing from centralised infrastructures to heterogeneous and geographically distributed infrastructures. Therefore, data processing solutions must consider ...
    • Conceptualization and scalable execution of big data workflows using domain-specific languages and software containers 

      Nikolov, Nikolay; Dessalk, Yared Dejene; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir; Roman, Dumitru (Internet of Things;Volume 16, December 2021, 100440, Peer reviewed; Journal article, 2021-11-26)
      Big Data processing, especially with the increasing proliferation of Internet of Things (IoT) technologies and convergence of IoT, edge and cloud computing technologies, involves handling massive and complex data sets on ...
    • Container-Based Data Pipelines on the Computing Continuum for Remote Patient Monitoring 

      Nikolov, Nikolay Vladimirov; Solberg, Arnor; Prodan, Radu; Soylu, Ahmet; Matskin, Mihhail; Roman, Dumitru (Computer;, Peer reviewed; Journal article, 2023)
      Diagnosing, treatment, and follow-up care of patients is happening increasingly through telemedicine, especially in remote areas where direct interaction is hindered. Over the past three years, following the COVID-19 ...
    • ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER 

      Layegh, Amirhossein; Hossein Payberah, Amir; Soylu, Ahmet; Roman, Dumitru; Matskin, Mihhail (IEEE Annual International Computer Software and Applications Conference (COMPSAC);, Chapter; Peer reviewed; Conference object, 2023)
      Prompt-based language models have produced encouraging results in numerous applications, including Named Entity Recognition (NER) tasks. NER aims to identify entities in a sentence and provide their types. However, the ...
    • DataCloud: Enabling the Big Data Pipelines on the Computing Continuum 

      Roman, Dumitru; Nikolov, Nikolay; Elvesæter, Brian; Soylu, Ahmet; Prodan, Radu; Kimovski, Dragi; Marrella, Andrea; Leotta, Francesco; Benvenuti, Dario; Matskin, Mihhail; Ledakis, Giannis; Simonet-Boulogne, Anthony; Perales, Fernando; Kharlamov, Evgeny; Ulisses, Alexandre; Solberg, Arnor; Ceccarelli, Raffaele (Lecture Notes in Business Information Processing;Volume 415, Conference object, 2021-05)
      With the recent developments of Internet of Things (IoT) and cloud-based technologies, massive amounts of data are generated by heterogeneous sources and stored through dedicated cloud solutions. Often organizations generate ...
    • Locality-Aware Workflow Orchestration for Big Data 

      Corodescu, Andrei-Alin; Nikolov, Nikolay; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir; Roman, Dumitru (MEDES: Management of Emergent Digital EcoSystems;MEDES '21: Proceedings of the 13th International Conference on Management of Digital EcoSystems, Conference object, 2021-11-09)
      The development of the Edge computing paradigm shifts data processing from centralised infrastructures to heterogeneous and geographically distributed infrastructure. Such a paradigm requires data processing solutions that ...
    • A Taxonomy for Cloud Storage Cost 

      Khan, Akif Quddus; Nikolov, Nikolay Vladimirov; Matskin, Mihhail; Prodan, Radu; Bussler, Christoph; Roman, Dumitru; Soylu, Ahmet (Communications in Computer and Information Science (CCIS);, Chapter; Peer reviewed; Conference object; Journal article, 2024)
      The cost of using cloud storage services is complex and often an unclear structure, while it is one of the important factors for organisations adopting cloud storage. Furthermore, organisations take advantage of multi-cloud ...
    • Towards Cloud Storage Tier Optimization with Rule-Based Classification 

      Khan, Akif Quddus; Nikolov, Nikolay Vladimirov; Matskin, Mihhail; Prodan, Radu; Bussler, Christoph; Roman, Dumitru; Soylu, Ahmet (Chapter; Peer reviewed; Conference object; Journal article, 2023)
      Cloud storage adoption has increased over the years as more and more data has been produced with particularly high demand for fast processing and low latency. To meet the users’ demands and to provide a cost-effective ...
    • TRANSQLATION: TRANsformer-based SQL RecommendATION 

      Tahmasebi, Shirin; Payberah, Amir H.; Soylu, Ahmet; Roman, Dumitru; Matskin, Mihhail (IEEE International Conference on Big Data;, Chapter; Peer reviewed; Conference object, 2023)
      The exponential growth of data production emphasizes the importance of database management systems (DBMS) for managing vast amounts of data. However, the complexity of writing Structured Query Language (SQL) queries requires ...